{ "cells": [ { "cell_type": "markdown", "id": "200badbc-9db1-45f8-ab23-b55a36f786de", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "# Loading Chromatogram Data" ] }, { "cell_type": "raw", "id": "7f6ada18-3d66-4064-a58c-5e36735be757", "metadata": { "editable": true, "raw_mimetype": "text/restructuredtext", "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ ".. currentmodule:: massdash" ] }, { "cell_type": "raw", "id": "aef105c5-fa5e-4a15-9040-66af75bb9f32", "metadata": { "editable": true, "raw_mimetype": "text/restructuredtext", "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "A Chromatogram Loader loads raw data from chromatograms. This leads to faster data loading since no on the fly extraction is needed however this leads to less flexibility. " ] }, { "cell_type": "code", "execution_count": 1, "id": "8a402b3f-e106-49b8-af66-cd95618581dd", "metadata": { "editable": true, "nbsphinx": "hidden", "slideshow": { "slide_type": "" }, "tags": [] }, "outputs": [], "source": [ "%load_ext autoreload\n", "%autoreload 2" ] }, { "cell_type": "code", "execution_count": 2, "id": "332bb4b3-e639-43e9-aab6-55d788215940", "metadata": { "editable": true, "nbsphinx": "hidden", "slideshow": { "slide_type": "" }, "tags": [] }, "outputs": [], "source": [ "# Please run this before executing any cell\n", "import os\n", "os.chdir(\"../../test/test_data/\") #### Insert path to data, this is the path to the tutorial data. " ] }, { "cell_type": "markdown", "id": "4956e403-6780-428a-9d5d-49ab28f5eb01", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "## Initiating a Chromatogram Loader" ] }, { "cell_type": "raw", "id": "88d418b1-ebd1-4e87-a8af-06d6b3620a06", "metadata": { "editable": true, "raw_mimetype": "text/restructuredtext", "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "All Chromatogram Loaders require the following inputs. " ] }, { "cell_type": "raw", "id": "dd832354-db54-4d65-b1aa-e362921f6e1c", "metadata": { "editable": true, "raw_mimetype": "text/restructuredtext", "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "\n", "1. dataFiles - a list of raw data files \n", "2. rsltsFile - a file containing the features" ] }, { "cell_type": "raw", "id": "1e3e2f4b-4e97-463c-bb4c-8ab42ea383e4", "metadata": { "editable": true, "raw_mimetype": "text/restructuredtext", "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "In the case of an :py:class:`~loaders.SqMassLoader` the data files must be a list of `.sqMass` files and the rsltsFile must be a `.osw` merged file output from pyprophet. This output is useful to visualize because it shows the exact Chromatogram which the OpenSwath peak picking uses for peak picking. " ] }, { "cell_type": "markdown", "id": "7217abb6-acb6-4f2b-875a-e34194fb8254", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "We can initiate a SqMassLoader object with multiple sqMass files as follows. " ] }, { "cell_type": "code", "execution_count": 3, "id": "bc1bc445-aabb-4e39-8d87-e63ee1fb1961", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "outputs": [], "source": [ "from massdash.loaders import SqMassLoader\n", "loader = SqMassLoader(dataFiles=[\"xics/test_chrom_1.sqMass\", \"xics/test_chrom_2.sqMass\"],\n", " rsltsFile=\"osw/test_data.osw\")" ] }, { "cell_type": "markdown", "id": "0b6ffb10-9b61-4252-b16c-dfc98d3407a0", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "
| \n", " | run | \n", "rt | \n", "intensity | \n", "annotation | \n", "
|---|---|---|---|---|
| 0 | \n", "test_chrom_1 | \n", "512.8 | \n", "1069.051908 | \n", "2274_Precursor_i0 | \n", "
| 1 | \n", "test_chrom_1 | \n", "516.4 | \n", "2230.982597 | \n", "2274_Precursor_i0 | \n", "
| 2 | \n", "test_chrom_1 | \n", "520.0 | \n", "2583.056921 | \n", "2274_Precursor_i0 | \n", "
| 3 | \n", "test_chrom_1 | \n", "523.7 | \n", "1876.955276 | \n", "2274_Precursor_i0 | \n", "
| 4 | \n", "test_chrom_1 | \n", "527.3 | \n", "1862.126603 | \n", "2274_Precursor_i0 | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| 2697 | \n", "test_chrom_2 | \n", "1251.0 | \n", "0.000000 | \n", "b4^1 | \n", "
| 2698 | \n", "test_chrom_2 | \n", "1254.7 | \n", "42.001872 | \n", "b4^1 | \n", "
| 2699 | \n", "test_chrom_2 | \n", "1258.3 | \n", "20.999608 | \n", "b4^1 | \n", "
| 2700 | \n", "test_chrom_2 | \n", "1261.9 | \n", "20.999608 | \n", "b4^1 | \n", "
| 2701 | \n", "test_chrom_2 | \n", "1265.6 | \n", "0.000000 | \n", "b4^1 | \n", "
2702 rows × 4 columns
\n", "| \n", " | \n", " | intensity | \n", "
|---|---|---|
| run | \n", "annotation | \n", "\n", " |
| test_chrom_1 | \n", "2274_Precursor_i0 | \n", "2.139805e+06 | \n", "
| b4^1 | \n", "3.000697e+04 | \n", "|
| y1^1 | \n", "1.300780e+05 | \n", "|
| y2^1 | \n", "2.837481e+04 | \n", "|
| y3^1 | \n", "3.879062e+05 | \n", "|
| y4^1 | \n", "1.295312e+05 | \n", "|
| y5^1 | \n", "5.707377e+04 | \n", "|
| test_chrom_2 | \n", "2274_Precursor_i0 | \n", "5.931736e+05 | \n", "
| b4^1 | \n", "7.226959e+03 | \n", "|
| y1^1 | \n", "1.137597e+04 | \n", "|
| y2^1 | \n", "1.631975e+05 | \n", "|
| y3^1 | \n", "4.025936e+04 | \n", "|
| y4^1 | \n", "3.567035e+03 | \n", "|
| y5^1 | \n", "1.758796e+04 | \n", "